Performance Assessment and Interpretation of Random Forests by Three-dimensional Visualizations
نویسندگان
چکیده
Ensemble learning techniques and in particular Random Forests have been one of the most successful machine learning approaches of the last decade. Despite their success, there exist barely suitable visualizations of Random Forests, which allow a fast and accurate understanding of how well they perform a certain task and what leads to this performance. This paper proposes an exemplar-driven visualization illustrating the most important key concepts of a Random Forest classifier, namely strength and correlation of the individual trees as well as strength of the whole forest. A visual inspection of the results enables not only an easy performance evaluation but also provides further insights why this performance was achieved and how parameters of the underlying Random Forest should be changed in order to further improve the performance. Although the paper focuses on Random Forests for classification tasks, the developed framework is by no means limited to that and can be easily applied to other tree-based ensemble learning methods.
منابع مشابه
Multi-Temporal Assessment of Mangrove Forests Change in the Coastal Areas of Bushehr Region Based on Landsat Satellite Imagery
Continual access to precise information about the land use/land cover (LULC) changes of the Earth’s surface is extremely important for any sustainable development program in which LULC serves as one of the major input criteria. In this study, a supervised classification was applied to three Landsat images collected in 1986, 1998and 2018, providing mangrove forests change data in the coastal are...
متن کاملThree-Dimensional Color Doppler Ultrasonography Study of Normal Liver Vascular Pattern in Dog
Objective- To create three-dimensional model of the dog's liver vessels by using threedimensionalcolor Doppler ultrasonography which can be used for surgery planning,tumor detecting, transplantation, and other diagnostic or treatment project.Design- Descriptive studyAnimals- 6 mixed breed dog, 1.6-1.7 years old, 18-20 kg weightProcedures- The liver was found by two-dimensional scan initially th...
متن کاملAssessment of protected vs. degraded oak forests: A geostatistical approach based on soil and plant diversity
Assessment of forest soil and vegetation characteristics provides basic and essential information for the protection and rehabilitation measures in forest ecosystems. Therefore, regard to the importance of this issue, the distribution of different soil properties and vegetation diversity in relation to conservation management and degradation investigated in the oak forests of Ilam province usin...
متن کاملDrawing Georeferenced Graphs - Combining Graph Drawing and Geographic Data
DATA VISUALIZATION Full Papers A Linear Time Algorithm for Visualizing Knotted Structures in 3 Pages Vitaliy Kurlin Supporting Event-based Geospatial Anomaly Detection with Geovisual Analytics Orland Hoeber and Monjitr Ul Hasan The Stor-e-Motion Visualization for Topic Evolution Tracking in Text Data Streams Andreas Weiler, Michael Grossniklaus and Marc H. Scholl The Visual Exploration of Aggre...
متن کاملIntroducing the improved Forest Canopy density (FCD) model for frequent assessment of Hyrcanian forest
Mapping of forest extent is a prerequisite to acquire quantitative and qualitative information about forests and to formulate management and conservation strategies. forest canopy density (FCD) model is one of the useful RS methods for forest mapping using satellite images. One of the most serious challenges in FCD model is the weakness in the calculation of canopy density in low density forest...
متن کامل